High Dimensional Data Clustering Using Cuckoo Search Optimization Algorithm

نویسندگان

  • Priya Vaijayanthi
  • Xin-She Yang
  • Raja Murugadoss
چکیده

The amount of data available over Internet and World Wide Web is increasing exponentially. Retrieving data that is more close to user’s query effectively and efficiently is a challenging task in Information Retrieval (IR) system. Clustering of Documents is one of the solutions to this. Clustering is the process of partitioning a set of objects in such a way that the objects in same cluster are more similar. The number of possible ways in which the documents can be clustered is enormous and this makes the problem to be a combinatorial optimization problem. Nature inspired algorithms are commanding tools to attack this type of problem. In this paper, an attempt has been made to use Cuckoo Search Optimization (CSO) algorithm to solve the problem of document clustering. The CSO algorithm is experimented with standard benchmark dataset, Classic4 dataset. The quality of solutions generated by CSO algorithm in terms of DB Index was compared with K-means algorithm and Ant Colony Optimization (ACO) algorithm. The results reveal that CSO algorithm is a viable to achieve world class solutions to high dimensional data clustering. Keywords—document clustering, optimization, cuckoo search, ant colony, meta heuristic

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved COA with Chaotic Initialization and Intelligent Migration for Data Clustering

A well-known clustering algorithm is K-means. This algorithm, besides advantages such as high speed and ease of employment, suffers from the problem of local optima. In order to overcome this problem, a lot of studies have been done in clustering. This paper presents a hybrid Extended Cuckoo Optimization Algorithm (ECOA) and K-means (K), which is called ECOA-K. The COA algorithm has advantages ...

متن کامل

Gene Clustering Using Metaheuristic Optimization Algorithms

Gene clustering is a familiar step in the exploratory analysis of high dimensional biological data. It is the process of grouping genes of similar patterns in the same cluster and aims at analyzing the functions of gene that leads to the development of drugs and early diagnosis of diseases. In the recent years, much research has been proposed using nature inspired meta-heuristic algorithms. Cuc...

متن کامل

Text Summarization Using Cuckoo Search Optimization Algorithm

Today, with rapid growth of the World Wide Web and creation of Internet sites and online text resources, text summarization issue is highly attended by various researchers. Extractive-based text summarization is an important summarization method which is included of selecting the top representative sentences from the input document. When, we are facing into large data volume documents, the extr...

متن کامل

Web Document Clustering Using Cuckoo Search Clustering Algorithm based on Levy Flight

The World Wide Web serves as a huge widely distributed global information service center. The tremendous amount of information on the web is improving day by day. So, the process of finding the relevant information on the web is a major challenge in Information Retrieval. This leads the need for the development of new techniques for helping users to effectively navigate, summarize and organize ...

متن کامل

Modeling and Comparison of Optimized Isotherm Models for H2, N2, CO, CH4 and CO2 Adsorption Using Cuckoo Search Optimization Algorithm

In this study, modeling of hydrogen, nitrogen, carbon monoxide, methane and carbon dioxide sorption on UTSA-16 framework extrudates in the pressure swing adsorption process was carried out. The pure gas adsorption of these gases at the pressure range (0 to 80) bars at (298, 313, and 338) K have also been measured in a fixed bed. Langmuir, Toth, Sips, UNILAN, Virial and Dubinin-Astakhov adsorpti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014